Exploratory Data Analysis

We're going to explore a large data set or traffic crashes to learn about what factors are connected with injuries. We will use data from the city of Chicago's open data portal. (This activity is derived from a blog post by Julia Silge)

This dataset is pretty crazy! Let's do some data munging to get it into a nicer form.

Exploratory Data Analysis

Here's a few questions to get you started.

  1. Take a look at crashes by latitude and longitude, colored by injuries. What do you notice?

  2. What are the most common contributing factors to a crash?

  3. How do crashes vary month by month? Compare crashes by month in 2022 to 2023.

  4. Are crashes more likely to cause injuries when it is rainy and dark? Use the variables weather_condition and lighting_condition to explore.

  5. Choose a question you want to explore, and create an appropriate visual.